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General linear electrodynamics allow for an arbitrary linear constitutive relation between 
the field strength two-form and induction two-form density if crucial hypcrbolicity and energy 
conditions are satisfied, which render the theory predictive and physically interpretable. 
Taking into account the higher-order polynomial dispersion relation and associated causal 
structure of general linear electrodynamics, we carefully develop its Hamiltonian formulation 
from first principles. Canonical quantization of the resulting constrained system then results 
in a quantum vacuum which is sensitive to the constitutive tensor of the classical theory. As 
an application we calculate the Casimir effect in a bi-refringent linear optical medium. 



1. INTRODUCTION 



Classical electromagnetism can be formulated on much more general optical backgrounds than 
the familiar ones described in terms of Lorentzian manifolds. From the point of view of electro- 
dynamics, this is because one merely needs a constitutive law that links the electromagnetic field 
strength two-form F with the induction two-form density and thus closes the relations 

{dF)al3-y = 0, {dH)ai3-y = Ca^^sj^ , (1) 

which any electromagnetic theory featuring charge conservation and no magnetic monopoles in four 
dimensions must satisfy in the presence of a current vector field density j. This point has been 
made most prominently and lucidly by pp. Even if one restricts attention to linear constitutive 
laws [38j, newthe resulting electrodynamic theories will generically feature birefringence, meaning 
that distinguished polarizations of light will travel at different speeds. Now the most general action 
for an electromagnetic gauge potential that results in a linear constitutive law, and which we will 
carefully quantize in this paper, is 



S[A, G] = -\j dx^UG [Fo^pF^s G"^^-^ + fAj^ , (2) 



where G is a smooth covariant rank four tensor field with the symmetries Gap^s = G^sap and 
Gai3"/S = —GjSa-yS, and which is invertible in the sense that there is a smooth contravariant tensor 



2 



field G""^^^ so that G'^^P^Gp^./S = 2{6^5^ - S^d^) and there is a well-defined volume form for 
such area metric tensors [2]. The birefringence of such general linear electrodynamics is encoded 
in its dispersion relation, or equivalently the causal structure, of the associated field equations 
|3]. This dispersion relation is known [4J to be of higher polynomial order, and indeed the central 
challenge faced in this paper is to properly deal with this fact, both in the classical and quantum 
analysis. The importance of understanding Maxwell theory on such general linear backgrounds is 
that the latter comprehensively describe all linear optical backgrounds ranging from fundamental 
spacetime geometries beyond Lorentzian geometry O [3HH] over the effective spacetime structure 
seen by photons to first order quantum corrections in a curved Lorentzian spacetime [9j to all 
non-dissipative linear optical media available in the laboratory [8j. 

The present article develops the canonical quantization of these most general linear electro- 
dynamics from first principles, and arrives at an explicit calculation of the quantum vacuum of 
the theory. We show that the related Casimir effect detects deviations from a non-birefringent 
background with an amplification which in principle is limited only by technological constraints. 
Arriving at these results requires special care when obtaining the Hamiltonian formulation of the 
classical theory that precedes the actual quantization. While quite generally the Dirac-Bergmann 
quantization procedure of course also applies to these gauge field dynamics, the key issue is the 
question of which hypersurfaces provide viable initial data surfaces on which the canonical phase 
space variables can be defined and evolved by the Hamiltonian. It is precisely this question that 
makes the problem of quantization of the dynamics Q so subtle, and requires the conceptually 
robust understanding of its causal structure developed in [3J and concisely summarized in section 
[2j Only when using the insights gained there, can the formulation of the Hamiltonian picture in 
section |3] and the canonical quantization in section [7| proceed as usual, based on the derivation 
of the Dirac brackets in section [4] and the diagonalization of the Hamiltonian in section [6j which 
is particularly simple for the area metrics in a neighborhood of Lorentzian metric geometries, as 
shown in section [5} However, having gone through the laborious quantization procedure, one is re- 
warded in section [8] with the said method to measure deviations from a metric-induced background 
through the Casimir effect in particular, and a demonstration of how to quantize field theories with 
higher-order polynomial dispersion relations |10ft27j in general. 

For simplicity, we restrict attention to flat area metric manifolds throughout the paper. Anal- 
ogous to any other geometric structure on a smooth manifold, an area metric is called fiat if there 
exists a set of charts covering the underlying smooth manifold such that the components of the 
area metric tensor are constant within each such chart. 
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2. CAUSUAL STRUCTURE OF LINEAR ELECTRODYNAMICS 

The Hamiltonian formulation of the dynamics ([2]), on which the canonical quantization will 
be built, hinges on several key results of the associated causality theory. Here we summarize the 
central results of practical importance. For a detailed derivation of these results we refer the reader 
to [3j. A necessary condition for Maxwell theory on a four-dimensional area metric background to 
be predictive is that the following polynomial [1] on covectors k, 

is hyperbolic. This means that there is at least one covector h with P{h) 7^ such that for every 
covector q the polynomials 

Pg,h{X) = P{q + Xh) (4) 

have only real roots A, in which case h is said to be a hyperbolic covector with respect to P. That 
hyperbolicity is a necessary criterion for a well-posed initial value problem is a central result of the 
theory of partial differential equations |28| I29j. For the flat area metric manifolds discussed here, 
the hyperbolicity of P is even sufficient for the predictivity of the theory [28]. Initial data, given 
on a hypersurface whose normal covectors are all hyperbolic with respect to P, are then uniquely 
evolved away from the hypersurface. Thus a Hamiltonian formulation of the dynamics, which deals 
precisely with the evolution between initial data surfaces, must be based on a foliation {t, a;"} of 
the manifold whose leaves t = const are hypersurfaces with hyperbolic conormal. 

However, this requirement needs to be sharpened if one requires that the actual initial data can 
be collected by observers. The definition of observers now hinges on the so-called dual polynomial 
[30] , which for those P that arise from area metrics by virtue of Q and which admit hyperbolic 
covectors, can be calculated explicitly and takes the deceivingly simple form 

which sends any tangent vector X to a real number. That the dual polynomial can be calculated 
analytically at all, and takes such a comparatively simple form, is only due to an interplay of the 
area metric structure underlying it and the necessary hyperbolicity of P . While the hyperbolic 
covectors of P distinguish admissible initial data surfaces, admissible observers are distinguished by 
their worldline tangent vectors being hyperbolic vectors of P*. In other words, the very existence 
of observers restricts the admissible area metric geometries further to those where also P^ is 
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hyperbolic. But exactly this hyperbolicity of then allows to make a choice of time-orientation, 
which in turn implies a choice of positive energy. More precisely, a time-orientation is chosen by 
picking one connected set of all hyperbolic tangent vectors, a so-called hyperbolicity cone out 
of the several such connected components defined by P*. But then the covectors q for which all 
future-directed observers measure positive energy, q{X) > for all X G C^, themselves constitute 
a cone (C*)"'' in cotangent space, which thus deserves to be called the positive energy cone with 
respect to the chosen time-orientation. The latter, in turn, selects the (open and convex) cone C of 
hyperbolic covectors of P that lie within the positive energy cone (C*)"''. For technical convenience 
we require, without loss of generality, that P be positive on all of C; indeed, from ([s]) it is clear 
that this always can be arranged for by switching the overall sign of G. 

Besides the hyperbolicity of P and P*, one finally needs to require that there exists a time 
orientation such that any non-zero P-null covector lies either in (C*)"*" or —{C^)^. In other words, 
the energy of any massless momentum is to have a definite sign upon which all observers agree. If 
and only if this bi- hyperbolicity and energy distinguishing properties are met, it is justified to call 
the underlying area metric manifold an area metric spacetime, and we will consider only such. For 
an illustration in a typical case, see figure [l} and for a detailed exposition of these concepts, see 

m 

The final piece of technology concerns the duality map between covectors and vectors in an area 
metric spacetime. The map 

L:C^ L{C) , L{q) = (6) 

is shown in [3] to be a well-defined and invertible Legendre map precisely because P is assumed 
to be bi-hyperbolic and energy-distinguishing. Spacelike hypersurfaces are meaningfully defined 
as those having tangent directions that are purely spatial with respect to some observer. More 
precisely, the spacelike hypersurfaces are those whose conormals lie in L~^(C'^). But since it can 
be shown that L^^(C*) always lies within C, the condition that a hypersurface be spacelike (and 
thus initial data on it accessible by local observers) further sharpens the condition for a feasible 
initial data surface for the dynamics Q we identified before. Thus only a foliation {t ^) of 

an area metric spacetime into spacelike hypersurfaces t = const (which then contain all vectors that 
are annihilated by the covector L^^{d/dt), see figure [l| provides an appropriate temporal-spatial 
split for the Hamiltonian formulation. For the flat area metric spacetimes considered here, one may 
further choose the coordinates such that L~^{d/dt) is conormal to the spacelike hypersurfaces. In 
other words, one can choose trivial shift and lapse in the flat case. 
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cotangent space tangent space 



FIG. 1: Causal structure of a typical bi-hyperbolic and energy-distinguishing polynomial dispersion relation. 
Dotted surfaces indicate sets of covectors and vectors that are null with respect to the polynomial P and 
its dual P'^ , respectively. The cone C of hyperbolic covectors and the cone of observers both arise as 
hyperbolicity cones. Purely spatial directions, as seen by an observer with worldline tangent T, are those 
vectors annihilated by the preimage of T under the Legendre map L. 

The reader may find it helpful to get a feel for these seemingly abstract conditions for the 
special case where the area metric is induced by a metric g by virtue of Ga/s-yS = Qa-^gps — gasgp^y- 
Precisely the same conceptual steps force one then to take the metric g to be of Lorentzian signature 
(otherwise Maxwell theory would not be well-posed). Since in this metric-induced case P{k) = 
{g°^^kakp) as usual, we have L{C) = C*, and thus one recovers the standard Lorentzian notions of 
obervers and spacelike hyper surf aces. However, the general construction presented before does not 
justify itself from this reduction to the metric case. The general treatment rather demonstrates the 
appropriateness and consistency of the standard Lorentzian definitions from a conceptual point of 
view. 



3. HAMILTONIAN FORMULATION 

With the appropriate foliation (t, x^, x^, rr^) of the area metric spacetime with spacelike leaves 
for constant time t and conormals given by L~^{d/dt), as constructed in the previous section, 
we are now in a position to develop the Hamiltonian formulation of the dynamics encoded in 
the action For a flat area metric spacetime, one can choose coordinates not only such that 
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the area metric has constant components throughout those charts, but also that additionally the 
components of the volume form ujg featuring in the action are numerically identical to those of 
the totally antisymmetric Levi-Civita symbol eap^yS defined by £0123 = +1- In such a coordinate 
system, we obtain the canonical momenta associated with the field variables {A^^Ai) from the 
Lagrangian density C of the action ([2]) as 

vr° = jj^, = 0, (7) 
oyuQAi) 

where here, and for the remainder of the paper, latin indices range from 1 to 3, while greek indices 
continue to range from to 3. In the language of the theory for constrained systems |3H l32] , we 
thus identify 0i = vr*^ w as a primary constraint of the dynamics. Defining the matrix Mij with 
the property MijG^^^^ = 6j, whose existence is guaranteed if the differential equations coming 
from ([2]) are hyperbolic (see appendix [A|) , and using ([T]) to express doAi in terms of the canonical 
momenta tt*, we find the total Hamiltonian density 

n = -^MjsTT^n' - AodjTT^ - irmijG'^'''''drAk (8) 

+ ^G'^'''diAjdkAr - ^MirG'''^''G''^"'^dmAndjAk+Uiix)7T^ix). 

Following the Dirac-Bergmann algorithm [32j for obtaining the Hamiltonian formulation of systems 
with constraints, we now compute the commutator {vr'^, Ti}. If this commutator is not zero, we need 
to impose {vr^,^} ~ as a secondary constraint, in order to ensure that the primary constraint 

01 ~ is preserved under time evolution. Indeed, one obtains {ir^^'H} = —djir^. Thus we impose 

02 = dj-n^ ~ as a secondary constraint, which must be added to ([s]) with a corresponding 
Lagrange multiplier. The total Hamiltonian now reads 

n = no + uiix)TT^ix) + {U2ix) - Ao)dj7r^, (9) 

with 

no = -^M,,7r%^ - vr^Mi.G'O'^^^a.ylfc + - MirG'''^^G'''-"'^)dmAndjAk. (10) 

Now we find {(/)2,'H} = 0, so that the Dirac-Bergmann algorithm ends here and 0i ~ and 
02 ~ exhaust the constraints. However, {(j)i{t, x), (j)2{t,y)} = 0, so that 0i and 02 are first 
class constraints, implying that the multipliers ui{x) and U2{x) are completely undetermined. The 
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infinitesimal gauge transformations induced by ((/>!, </>2) on the canonical variables {Aa,Tr°') are 

5AUt,x) = j d^ye^it,y){A^{t,x),<j)iit,y)} = e\t,x)5^^-6idie\t,x), (11) 

57r-(t,x) = I d'ye'it,y){7r''{t,x),(^i{t,y)} = 0, (12) 

with 7 = 1,2 and e^(t, x) being the infinitesimal parameters of the transformations. Knowledge of 
these generators of gauge transformations allows us to identify classical observables of the theory 
as those functionals that are invariant under gauge transformations. Equivalently, observables 
commute with the constraints {0,(f)i} ~ 0. In the present case, it can be checked that the 
electromagnetic inductions 

jja ^ -G'^<^°''Fob - ^G^^''''Fbk (13) 



Ha — — -eOafec 
1 

.^Oabc 



'On^j - Uj/±o - ^ Ub^k 

GbcmO TP I ^ r^bcmn rp 



mO^^^ rmn 



(14) 

G'^^°id^Ao-doAm) + G'^^^d^Aj , 
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defined with respect to the chosen foliation of spacetime into spacelike hypersurfaces, indeed com- 
mute with the constraints, so that they can be used as observables. Thus we are finally able to 



write the Hamiltonian ( 10 ) for our system in terms of gauge-invariant observables and Ha as 



-Ho = ^UaiD^D' + ^V^'HaHi, (15) 



where the matrices U and V are given as 



U, 



al = -Mai + gTp,,feGO^^'=G°*P'?M,(,M,), (16) 

yal ^ _^^Ojkia^\0\l)pqrj.^^,^^ (j^) 
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with Tpqjfc defined such that 

The existence of T is guaranteed due to the invertibility properties of area metrics; indeed it can 
be written explicitly in terms of the block matrices constituting the area metric tensor, see again 
appendix |X| 

4. GAUGE FIXING AND DIRAC BRACKETS 



In order to determine the Dirac brackets associated with our system, one needs to remove the 
indeterminacy in the Lagrange multipliers by fixing a gauge. This is achieved here by manually 



8 



imposing two further constraints 03 0, 04 such that det{(?!)/(x), 0j(y)} ^ 0, with I^J = 
1, ... 4, so that the new set of constraints 0/ is now of second class. In our case, the Euler-Lagrange 
equations for the gauge field A obtained from the action ([2]) are given by 



which is conveniently split into one temporal equation 



and three spatial equations 



G^^^^da dc - G"'^^''5o da 



OaOfe £ 



da db - G""^'" ^0 a, 



OlOm 



Q0l0mQ2 _ 2GO(''")'^ao da + G^''""^da 8^ 



As the third constraint we impose the Glauber gauge 

03 = Aq{x) - j d?x' G(f , f d'^Akix') w 

with —G^""^^ da db G{x, x') = 6{x — x'), or more explicitly, 

G{x, x') ^ 



(19) 



(20) 



(21) 



(22) 



(23) 



47rv^-Mab(x" - x'")(x'' - x'^) ' 
The expression under the square root is non-negative ultimately due to the energy distinguishing 



property (see appendix |A[). Consistency of the gauge (22) with the temporal equation (20) requires 
that the last constraint 



,4 = GO"°^9aAbP.O. 



(24) 



In summary, our constraints 0/ are given by 



bi=TT'^ ^ 0, 

h = 5avr« « 0, 



63 = ^o(^) - / d^x' G{x, x')G^^^^&a d'cMx') ~ 0, 

64 = 00-^0^9,^-0, 



(25) 



and satisfy 



{0/(t,f),0j(t,y)} 



(2^ 







1 



-1 

-G^'^'^^kah 




Jk.{x-y) 



(26) 



G^'^^^kah 

The matrix above {0/(t, x), 0j(t, y)} is invertible, so that the constraints 0/ are now of second 
class and the gauge freedom is gone. Its inverse ({0(x), 0(y)}~^)^'^, defined through 

d'ymx)Ajm {{m,mry = 5f5{x-z), (2?) 
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is simply given as 



{<Pl{t,x),<Pj{t,y)}-^ 



(27r3) 












1 







1 



G^'^^^kakb 











Jk.{x--y) 



(28) 



Equipped with equation (28) we can now follow Dirac's procedure and replace the standard Poisson 



bracket {, } by the Dirac bracket {,}di which is defined as 



{A{x),B{y)}D = {A{x),B{y)}- J z d'w {A{x), 4>i{z)}m^, (t>{w)r'y' {(t>j{w), B{y)}. (29) 

Thus we arrive at the fundamental Dirac brackets of our system, with respect to which the theory 
must be quantized 

d^k 



(2vr)= 



■ ^ _ , . _ 6^6^ kg km G'"'^^ _ 5l5^,G^-^-kak, 

" "0 G^P^^kpkq G^P^^k^ka 



{Ao,{t,x),7rl'{t,y)}D = 
{Ac,{t,x),A^{t,y)}D = 0, 
{7T^{t,x),n^it,y)}D = 0, 



and the dynamics of the system is simply generated by the Hamilton equations 



Jk.{x-y) 



(30) 



dtAo,it,x) « j dS{Aait,x),'Hoiy)}D 
5t^"(t,x) « [ d^y{TT^{t,x),no{y)]D 



(31) 



where, due to the use of Dirac brackets, only T-Lq is involved. 



5. BIHYPERBOLIC AREA METRICS CLOSE TO LORENTZIAN METRICS 



The preceding Hamiltonian analysis and calculation of Dirac brackets made only implicit use 
of the requirement that the area metric background be bi-hyperbolic and energy-distinguishing, 
namely in the abstract constructions underlying the definition of spacetime foliations into spacelike 



leaves. But now we need to explicitly solve the field equations (21) with the gauge imposed by 



( 22 ) , and this requires to restrict attention to concrete bi-hyperbolic and energy-distinguishing area 



metric backgrounds. Moreover, for actual calculations it is most convenient to choose a coordinate 
frame in which the area metric takes a simple normal form. The normal form theory of area metrics 
in four dimensions has been developed in [S], and used in |3j to show that the area metric cannot 
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be bi-hyperbolic unless the endomorphism J on the space of two-forms defined through 

j^ap ^ G^'^-u:^,^^ (32) 

has a complex eigenvalue structure (Segre type) of the form [llllll], [22ll], [33], [llllll], [2211], 
[111111] or [mill]. However, four- dimensional area metrics that are induced by a Lorcntzian 
metric automatically lie in the first class, [llllll], and moreover the continuous dependence of the 
eigenvalues of an endomorphism on the components of a representing matrix implies that any area 
metric in the neighborhood of such a metric-induced area metric is equally of class [llllll] . Thus 
area metrics of immediate phenomenological relevance are clearly those of this first class, and it 
can be shown that by GL(4) frame transformations these can always be brought to the form 

-a p 
0-/3 (7 
-7 r 
/9 a 
(7 /3 
T 7 

where for notational purposes, G is considered here as a bilinear form on the space of two-forms for 
convenience, and the representing matrix shown above is with respect to the obvious induced basis 
in the order [01], [02], [03], [23], [31], [12]. The positivity of a , /3 , 7 follows from our convention that 
P is positive on the hyperbolicity cone C. If and only if the area metric is induced by a Lorentzian 
metric, do the real scalars assume the values a = /3 = 7 = 1 and p = cr = r = 0. So any finite (but 
not too large) deviation from a Lorentzian metric is encoded in these scalars and the frame that 
brings about this normal form. 

It is straightforward to show that if one chooses p = cr = r, the polynomial 



for real p, cr, r and 
real positive a, /3, 7 , 



(33) 



2 2\ 



P{q) = a(3jiq^ + qt + ql + ql) 
+a{P^+j'){qlql 
+P{a'+j')iqfql-qiql) 
+7{a' + P'Mql - qhl) 



(34) 



associated with an area metric of this class is hyperbolic with respect to h = L'^ip/dt). This is 
most efficiently verified in the normal frame by observing that for h = (1, 0, 0, 0), the real symmetric 
Hankel matrix Hi{Pq^h) associated with the polynomial Pq^h is positive definite for any covector q, 
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which imphes that P is hyperboHc |331 134j . The dual polynomialm takes precisely the same 
shape in the normal form frame employed here, and thus is also seen to be hyperbolic. Finally 
also the energy-distinguishing property is easily checked. Finally, note that for area metrics with 



polynomial (34), we have 



Qahc p gOafec 



(35) 



in this normal form frame, which significantly simplifies the field equations (20) and (21) whose 



solutions we will now be able to obtain, orthogonalize appropriately, and thus obtain a diagonal- 
ization of the Hamiltonian. 



It is worth noting that the hyperbolic polynomial (34) only factorizes if at least two of the 



scalars a, /3, 7 coincide, so that area metrics with a bi- metric dispersion relation merely present a 
subset of measure zero within the set of area metrics neighboring Lorentzian metrics. Indeed, for 
the generic case of mutually different scalars, the polynomial P is irreducible. Thus theories trying 
to account for birefringence in linear electrodynamics by some sort of bi-metric geometry fail to 
parametrize almost all relevant geometries near Lorentzian metric ones. 

6. DIAGONALIZATION OF THE HAMILTONIAN 



In order to diagonalize the Hamiltonian (10) for bi- hyperbolic and energy-distinguishing general 



linear electrodynamics with a higher-order polynomial dispersion relation given by (34), we first 



need to find the solutions of the classical field equations (20) and (21 ). After choosing the Glauber 



gauge (25), the first equation is trivially satisfied, and the second one reduced to 



(36) 



due to (35). Moreover, these field equations are completely equivalent to the field equations arising 



from (31). Specifically, we look for plane wave solutions 



(27r)2 



j(ajt+p.x) 



SO that introducing (37) into (36) we observe that the equation 



must be satisfied if (37) is indeed a solution. Equation (38) has non-trivial solutions only if 



det {(f^^'^[u^f + G^-paV^ = . 



(37) 



(38) 



(39) 
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The non-zero frequencies uj for which this is the case are precisely the solutions of P{uj,p) = 0, 



compare ( A6 ) . From the energy distinguishing condition of an area metric spacetime it follows that 



this frequencies are non-zero unless p = 0, and real because of the hyperbolicity of P. It is then 



further immediate from (34) that if some (without loss of generality positive) uj{p) is a solution for 
some given p in our normal frame, then so is —uj{p), and that uj{p) = uj{—p). Thus we have four 
non-zero energy solutions ±uj^ (p) labeled by / = 1, 2, two positive and two negative ones, for each 
spatial momentum p. Therefore any solution of the field equations for the real gauge potential A 
can be expanded as 

1=1 2 Jn'"^°°^^ 

where strictly speaking, the integral is to be taken only over spatial momenta p for which the 
roots CO of P{u},p) are non-degenerate, so that the elementary plane wave solutions are linearly 
independent. However, the set of covectors for which these zeros are degenerate is of measure zero 
|3], so that this restriction of the integral domain can be technically disregarded. It may be worth 
emphasizing that the standard appearance of this expansion is somewhat deceptive, since the 



appearing here are solutions of (39), rather than the standard Lorentzian dispersion relation. 

Having obtained a basis of solutions of the classical field equations, we now identify an inner 
product that is preserved under time evolution and positive definite for positive energy solutions. 
To this end, consider solutions Aa{p){t,x) and Aa{q){t, x) of the field equation for specific spatial 



covectors p and q, respectively. Using the field equation ( 36 ) , it can be shown that the continuity 
equation 

do (GO"O^A:(p)'5'oA(gl) + dm (-G-^'^^^'Al{p)'^nAt{q)) = (41) 



is satisfied. This implies that we have a conserved charge Q given by 

Q = I d^xG^-''A:{p)'^oAb{q). (42) 

The above defined charge Q can be used to define a scalar product in the space of solutions, which 
then by definition is conserved under time evolution and is defined as {A{p),A{q)) = —iQ. It 
satisfies the following properties 

{Aip),XAiq)) = X{A{p),Aiq)) 

{XA{p),A{q)) = X*{A{p),A{q)) (43) 
{Aip),A{q)) = iAip),Aiq)r = -iA*ip),A*iq)). 
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Hence, if we define for our different frequency solutions 

Fiim ^) = e-^^-'^^'+P-'^fi{p), (44) 
we find that {F^ (p), F*-^ (q)) = and 

iF'ip),FJ{q)) = -{F*\p),F*Jiq)) = -2u' {p)G^-^^ fi* {p)fi{p)5'J 5{p - q). (45) 
In the derivation of the above results we used charge conservation to find that for I ^ J 

G^-^^fi*{P)ff{-p) = G^''^'f^*{p)fi!ip} = 0. (46) 



Moreover, since C^*^"^ is negative definite due to (33), equation (45) shows that the positive energy 



solutions can be positively normalized, implying in turn that the negative energy solutions are 
negatively normalized. This indefiniteness of the scalar product is responsible for creation and 
anhilation processes. Choosing, without loss of generality, fi(p) = ""^f^ , we finally have 

{F\p),F\q)) = -{F*\p),F*\q)) = -G^'''''ai*ip)aiip)S'' 5{p - q), (47) 

and our general solution reads 

Aa{t,x) = j (alp {e-^^-"^'^'^'^-'^ai{p) + e-(-^(^-^*+^^-%:^(p-)) . (48) 

Now that we have the general solution ( |48[ ) , we can use it to write the Hamiltonian evaluated at a 
solution in diagonal form, 

Hq = j d^xHoix) = -\j d'xGO'^o'' {doAadoAk - AadlA^) (49) 



2 ^ J (27r) 

7=1,2 ^ ' 

The last expression shows that the classical Hamiltonian is positive because C^""^ is negative 
definite. 

7. QUANTIZATION 

Equipped with the results developed so far, we are now ready to quantize the electromagnetic 



field. First, notice that if we multiply equation (38) by pi then the amplitude eigenvectors a\^{p) 
satisfy 

G'^'^^aalip) = 0, (50) 
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such that the constraints G^°'^^daAb w and dair'^ ~ are satisfied. Now it can be shown [3] that 
for almost all spatial momenta p, the two associated positive energies do not coincide, uj^^^{p) ^ 



1=2 1 



(p), so that the covectors (p) and ay (p) are determined up to scale, linearly independent 
and thus forming a basis for the space of all purely spatial covectors v for which G^^-^^paVh = 0, and 



diagonalize the Hamiltonian (49). Thus the only one freedom we have is a choice of normalization, 
which we choose such that any solution aj^{p) is expressed as a^{p) = ci^{p)^l{p) with the covectors 
el{p) normalized with respect to our scalar product, i.e., 

-G'^''ei'{p)ei{p) = l, (51) 

where there is no summation over /. Furthermore, pa and any a^(p) are clearly linearly independent, 
such that the set of covectors 

P 1 



ei=Hp),ei=\p), 



1=2, 



aOfe 



PaPb 



(52) 



constitute a basis for V ^ which is orthonormalized with respect to the scalar product (43). Hence, 
they satisfy the completeness relation 

-'tPs 



(53) 



Notice that the normalized covectors (p) satisfy the orthogonality identities ( 46 ) . Now the general 



solution (48) takes the form 
^ I 

7=1,2 



Aait,x) 



{p)t+p.x) I 



a\p)ei{p) + e 



i{uj' {p)t+p.x) I* I 



a'*{p)e*J{p)) ,(54) 



(2vr)3 

where the coefficients (p) correspond to the amplitudes of the solutions and depend on the initial 
values that one considers for a specific problem in the classical approach. At the quantum level, 
these amplitudes are precisely the mathematical objects that should be promoted to operators, 
such that the corresponding quantum field reads 



Mt,x) = [ 

T 1 O 



d'^p 



1 



7=1,2 ' 



-i{uj' {p)t+p.x) --I 



a {P)^a{p) + e 



d'Hp)Cip) 1(55) 



(2^)3 y2i7M 

Using this quantum solution and the expressions for the energy and spatial momentum (which can 
be obtained by calculating the energy-momentum tensor) we find that the quantum Hamiltonian 
and quantum spatial momentum operators are given by 



Pi 



E 

7=1,2 

E 

7=1,2 



d?p 
(2^ 

d?p 



u\p) [a\p)d'\p))+a'Hp)a\p)) 
a\p)a'Hp))+a'\p)a\p)) 



3 Pi 



(56) 
(57) 
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Hence, if we identify the operators a^{p), a^^{p) with annihilation and creation operators respec- 
tively, a condition for the Hamiltonian to be positive definite is that these operators obey the 
bosonic commutation relations 

[a\p),a''\q)] = (iTrfd'-" Sip - q) , (58) 
[a'ip),aJ{q)] = [a'Hp),a'\q)] = 0. 

Hence, the quantum Hamiltonian operator can be written as 

^0 = E / 7^^'(p1a'^(p)a^(p)) + E ^ / d'pu:'{mO), (59) 

1=1,2'' ^ ' 1=1,2 

from which expression we identify the energy of the electromagnetic vacuum, which was calculated 
here for plane wave solutions without any boundary conditions, as 

Svac(no boundaries) = ^ ^ j d^puj^ {p)S{0) . (60) 

1=1,2 

In the next section, we will calculate how this expression changes if one imposes boundary condi- 
tions. Finally, by using the completeness relation (|53|) one confirms that 



Sp 

(2^)3 



PmPaG' 



OmOfe 



G^'^'^^PrPs 



eip-{x-y)^ (61) 



which shows the consistency of the quantization procedure with the Dirac brackets ( 30 ) , since the 



latter reduce to the above form due to ( 35 ) . 



8. APPLICATION: CASIMIR EFFECT IN A BIREFRINGENT LINEAR OPTICAL 

MEDIUM 



The Hamiltonian ( 59 ) shows that the quantization of general linear electrodynamics leads to a 
modified quantum vacuum compared to standard non-birefringent Maxwell theory. In fact, local 
physical phenomena which only depend on the quantum vacuum can be used to test and bound the 
non-metricity of spacetime. In this section we analyze one such phenomenon, namely the Casimir 
effect; similar studies can be conducted for the Unruh effect and spontaneous emission. 

The Casimir effect [35] arises because of the energy cost incurred by imposing boundary condi- 
tions on the electromagnetic field strength. Physically, such boundary conditions arise for instance 
by introducing perfectly conducting metal plates into the spacetime. For two infinitely extended 
plates parallel to the 1-2-plane, and this is the configuration we will study here for general linear 
electrodynamics, the electromagnetic field strength must satisfy the boundary conditions 

^Ollplates = ^02|piates = -^islplates = ^ (62) 
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everywhere on either plate; this follows, by Stokes' theorem and thus independent of the geometric 
background, from the physical assumption that the plates are ideal conductors inside of which the 
field strength must vanish. 

Now the key point is that having, or not having, boundary conditions for the vacuum amounts 
to an energy difference, the so-called Casimir energy 

-Ecasimir = -Evac(pla'te boundaries) — £'vac(no boundaries) . (63) 

But both energies on the right hand side diverge and need to be regularized such that their 
difference is independent of the regulator. This is most easily achieved by first considering boundary 



conditions analogous to (62), but for all six faces of a finite rectangular box with faces parallel to 
the coordinate planes, and separated by coordinate distances Li,L2,L3. In a second step we 
will then push all faces a very large coordinate distance L apart in order to obtain an expression 
for i?vac(iio boundaries) regularized by L, and similarly push all but two faces in order to obtain 
a corresponding regularized expression for E'vac (plate boundaries). The difference of these two 
regulated quantities will indeed turn out to be finite per unit area and be independent of the 
regulator L. 

Now more precisely, a basis of solutions of general linear electrodynamics satisfying the box 
boundary conditions is labeled by a triple {ni,n2-,n^) of non-negative integers and a polarization 
1 = 1,2 and takes the form 

Ax{x) = a^(ni,n2,n3)cos(ni7r^)sin(n2vr^)sin(n37r-^) , 

Li L2 -L3 

Ay{x) = a^(ni,n2,n3)sin(ni7r^)cos(n27r^)sin(n37r^) , (64) 

Li L2 Li 

Az{x) = 02(71-1, n2, ns) sin(ni7r^) sin(n27r-^) cos(n37r-^) , 

Li L2 -L3 



where the a^(ni,?7-2,n3) are solutions to equation (38) for aj^(ni7r/Li, n27r/L2, 're37r/L3), which 

always exist if the dispersion relation is bi-hyperbolic and energy distinguishing. The vacuum 

energy in the presence of the box boundary conditons is thus given by the discrete sum 

^ 00 

S^ac(box boundaries) = a;'^(7r — , vr — , vr — ). (65) 

n=0 1=1,2 ^ ^ 

Removing appropriate faces to a coordinate distance L one finds from this, in the very large L 
limit, the L-regularized expression for the vacuum energy without boundary conditions 

E^^^{no boundaries) = — ^ / d^puj^{p), (66) 
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and the L-regularized expression for the vacuum energy in the presence of two plates parallel to 
the 1-2-plane and separated by a coordinate distance d 

L2 



-^vac (plate boundaries) = ^ XI H / dp^dpyU^ {pI,pI, 



~d' 



(67) 



where the prime in the summation symbol n means that a factor 1/2 should be inserted if this 
integer is zero, for then we have just one independent polarization. Hence we find for the physical 
vacuum Casimir energy Uid) = (£'vac(plate boundaries) — £'vac(no boundaries) )/L^ per unit area 



U{d) 



^ Yl Y[ dpccdpyu^ (pl,pl,{^f^ - f dp:,dpydp^oj\pl,pl,pl) 

1=1,2 L n' •'^ 



(68) 



In principle, the execution of the above integrals can proceed as in the standard case. However, with 
the frequencies uj^ now being solutions to a quartic, rather than quadratic, dispersion relation, these 
integrals are much harder particularly due to the absence of rotational invariance. Fortunately, 
the fact that contributions from the two different polarizations 7 = 1,2 are simply added in the 
above expression allows for an analytic study of the case where the polynomial P is reducible. In 
terms of the scalars a, /3, 7, p defining the area metric in a normal form frame, this is the case if 
and only if two of the scalars a, /3, 7 coincide, and we may take a = /3, for instance. Even in this 
simplest of non-trivial cases, the Casimir energy crucially depends on the birefringence properties 



of the underlying general linear electrodynamics. More precisely, the polynomial in (34) factorizes 
into two Lorentzian metrics. 



P{p) = a{apl - apj - -/{pj +P2))(7Po " TPs " «(Pi + P2)) 



(69) 



so that we immediately obtain the positive energy solutions 



UJ 



.1=1 



a 



[apl + -fpl + -fpf) 



1/2 



and 



.1=2 



1 



(jpI + apl + apl) 



1/2 



(70) 



turning (68) into a sum of integrals as they appear in the standard Casimir problem on a Lorentzian 



background. Thus from here on the standard calculation of the Casimir effect |36] can be followed 



for each of these integrals separately, and one finally obtains the Casimir energy ( 68 ) 

Tjr A I f a 7\ TT^ 

^(^) = -2U + aJ^- 
This energy difference of course results in a Casimir force 

F{d) = -U'{d) -- 



(71) 



1 / a 7 \ TT^ 

2 V7 ^ aj 240cF 



(72) 
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between the plates. The standard Casimir force is recovered if and only if a = /3 = 7, and irrespec- 
tive of the value of the scalar p. This in turn is equivalent to the absence of classical bi-refringence 
j37j . Note that the amplification of any bi-refringence is limited only by the technological constraint 
of how small the separation d between the plates can be made in any realistic set-up. In contrast to 
classical bi-refringence tests, which usually require accumulative effects over large distances (with 
all the uncertainties present in such non-local measurements), one sees here that the Casmir force 
allows for a detection of bi-refringence by way of a highly local measurement. Conversely, of course, 
experimental measurements of the Casimir force agreeing with the standard prediction within the 
given technological constraints can be used to put stringent bounds on the non-metricity of the 
spacetime region where the measurement is conducted. 

9. CONCLUSIONS 

The canonical quantization of general linear electrodynamics, as undertaken in this article, 
required the solution of several, and in themselves challenging, questions. 

First, from the classical field theory point of view, it had to be clarified which general linear 
electrodynamics are predictive on the one hand and physically interpretable in terms of quantities 
measurable by observers on the other hand. The answer to both questions is encoded in the 
polynomial dispersion relation of the field theory, and amounts to the simple algebraic conditions 
that the latter be bi-hyperbolic and energy-distinguishing. Further down the road, these conditions 
turned out to be crucial in ensuring the existence of a Glauber gauge, which allowed to define a 
time-conserved scalar product in the space of classical solutions, on which all further developments 
were based. 

Second, and closely related, is the construction of a Hamiltonian formulation of general linear 
electrodynamics. The causal structure encoded in the higher-order polynomial dispersion relation 
of this theory required a revision of the construction of suitable spacetime foliations that underlie 
a Hamiltonian formulation. The key point here was that the leaves of the foliation must be such 
that initial data provided on them must be causally evolved by the field equations and at the same 
time be accessible to observers. It turned out that bi-hyperbolic and energy-distinguishing area 
metric manifolds provide precisely the structure to ensure both, and ultimately render the classical 
Hamiltonian positive. 

Third, the quantum Hamiltonian operator is positive definite. For a theory with a higher-order 
polynomial dispersion relation this is far from trivial, and again only due to bi-hyperbolicity and the 
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energy-distinguishing property. The positive definiteness of the quantum Hamiltonian operator is 
inherited from the positivity of the classical Hamiltonian because the positive energy solutions have 
positive norm with respect to the scalar product identified before. This is of course synonymous 
with the stability of the quantum vacuum, and thus the physical relevance of the Casmir effect we 
derived from it. 

The wider lesson learnt from our study consists in this being a prototypical, and rather non- 
trivial example for the quantization of a field theory with a modified dispersion relation. Such 
theories are discussed extensively throughout the literature with a number of motivations, but 
usually disregarding the fundamental consistency conditions that were instrumental in this work. 
In particular, the classically inevitable condition that the dispersion relation be given by a bi- 
hyperbolic and energy distinguishing polynomial proved inevitable also at virtually every step of 
the quantization process. 

Actual calculations were made tractable by employing the fact that the dispersion relation 
of general linear electrodynamics is ultimately determined by a fourth rank area metric tensor 
for which a complete algebraic classification and associated normal forms are available for the 
phenomenologically directly relevant case of four spacetime dimensions. This normal form theory 
was also used to ensure that the birefringent optical backgrounds for which we calculated the 
Casimir effect (and which owe their physical relevance to their parametrizing the neighborhood 
of non-birefringent optical media) indeed are bi-hyperbolic and energy-distinguishing. While it 
is possible to directly exclude 16 out of a total of 23 algebraic classes of four-dimensional area 
metrics as admissible spacetime structures, a complete and simple characterization of all area metric 
manifolds that are bi-hyperbolic and energy-distinguishing however remains an open problem. The 
high interest that would attach to a comprehensive solution of this problem is clearly underlined 
by the pivotal role we saw these conditions to play for the classical and quantum theory alike. 

Another open, albeit well-defined, problem is the coupling of fermions to general linear electro- 
dynamics. The issue is the very definition of spinors in the presence of a higher-degree polynomial 
dispersion relation, rather than one given by a Lorentzian metric. For rather than satisfying the 
standard binary Dirac algebra, generalized Dirac matrices that intertwine spacetime and spinor 
indices must now satisfy a quarternary algebra determined by the fourth-degree polynomial as- 
sociated with a four-dimensional area metric spacetime structure. Even employing the normal 
form theory, representations of this quarternary algebra appear hard to find in any other but the 
case of a reducible dispersion relation satisfying the relevant conditions (which then leads to a 
sixteen-dimensional spinor representation with an associated refined Dirac equation for this special 
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bi- metric case). Once a representation in the general case is obtained, the canonical quantization 
can proceed exactly along the now clearly defined path for such theories, and complete a full theory 
of general linear quantum electrodynamics including charges. 

Concluding, we see that the results of this article open up the arena for comprehensive, and 
above all conceptually watertight, studies of quantum effects brought about by birefringence. In- 
deed, beyond the Casimir force we calculated here explicitly, any other effect rooting in the quantum 
vacuum of electrodynamics can be directly calculated now on the basis of the technical findings of 
this paper. This includes for instance the Unruh effect or the spontaneous emission of photons from 
quantized point particles. Once spinor fields are included, the range of effects of course extends 
to the full spectrum of processes discussed in standard quantum electrodynamics with charged 
fermions. Far from being merely academic musings, however interesting, these findings are of im- 
mediate relevance to physicists with interests ranging from fundamental theory to material science. 
Indeed, while on the one hand directly testable in birefringent optical media in laboratory exper- 
iments |l9j , the constructions of this paper on the other hand also put phenomenological studies 
of modified dispersion relations |10ft27j (or, equivalently, Lorentz-violating spacetime structures 
|38] - [i7] ). as they now abound in the literature, on a solid theoretical footing. 



Appendix A: area metrics and hyperbolicity 

An area metric in four dimensions takes the following block matrix Petrov form [8] 

^[q/3][75] __ 



M K 

K'^ N 



(Al) 



where the antisymmetric index pairs [01], [02], [03], [23], [31], [12] label, in this order, the basis in 
which the matrix is given. The matrices M, K, N are 3x3 matrices are related to the area metric 
through 





G 


= 


1 

— ( 
2 




1 


Nab = 


4' 



OaOb 



(A2) 



^Oamn 



where e is the totally antisymmetric Levi-Civita symbol. If M in the expression above is invertible, 
which at the end of this section we will see to be the case if the correspondig area metric leads to 
well-posed field equations, then det(A^ — K^MK) ^ 0. This ensures the existence of the object 
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Tpqtu defined in (18), which can be shown to be explicitly given by 



' abtu 



-2etufeabm {{N - K^MK) 



(A3) 



The principal symbol of the linear field equations governing the dynamics for the electromagnetic 
field on area metric backgrounds was found in [8j as the determinant of the 6x6 matrix 



GOmOn„ n/'^0(mn)a„ 1^ /^efma^ 

PO — ZLr 'Pa —i^enef^ ' Pa 



e Pa o^po 



(A4) 



Using that for any n x n matrices A, B, C, D 



det 



A B 

C D 



det{AD-BC) if CD = DC, 



(A5) 



we can write the determinant of (A4) as the determinant of a 3 x 3 matrix as 



det{Ayc.) = det (g0™°>2 _ 2GO(-")>,po + C"paPu) = -plP{p) 



(A6) 



where P is precisely the polynomial given in equation ([s]). Using (A6), the polynomial P can now 
be expressed in terms of the constitutive matrices M, K, N . After calculation one finds 



Pg{pq,P) = apl + b{p) pI + c{p) pI + d{p) po + e( 



(A7) 



with coefficients 



a = -det(M'^^), 

b{p) = eabcGfl'^K^pm, 

c{p) = - [Nrf G'IP^ + eabce'^^^'K^KM"' + 2i^^K^M"]' + 2KiK^^M"'^'^ pipm , (AS) 

d{p) = -2e''^ (2Ar^,K[™M«]' - K'^'KiKi) pip^Pn , 

e{p) = -N.je^'^'ef^'l^-M^'^Nrs-KKi^puP^p.Pn, 

where C^^^ = M°-^M"^'^ — M°-"-M"^^. Using this expression, we find that a necessary condition 
for the well-posedness of the initial value problem is that the matrix M"^ is invertible. Moreover, 
for an observer's frame, in which the coefficient b{p) = as explained in [3|, the matrices K and 
M satisfy 



j^[-jVf"]/ = 0. 



(A9) 
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In this case, equations (A8) reduce to 



a = -det(M"''), 

Kp) = 0, 

c{p) = - (^Nrf G'[P^ + Cabce'^^'-KiKM'^^ + 2K^,K^J^M-V^ pipm , (AlO) 

dip) = -2e''^ (2NeaKl!^M'^V _ K^KiK^) pip^pn, 

e{p) = -N^fe^^ef^' Qm^^AT,, - K^K^^ PuPmPvPn- 

For area metrics for which there exists a frame such that G^"''"^ = pe^"'^'^, such as those considered 
from section ([s]) onwards, or equivalently = 4)6^, the polynomial Pg{p) is further reduced to 

Pg{P0,P) = -p^det(M) - pl{NrnnGTInP^Pj " \{M''p^P,){G'''''' N,,pm) (All) 
= -pldet{M) - pliNmnGZnPiPj - detiN)iM'^p,p,){N-'fpkPi. 

From the energy distinguishing property of the area metric spacetimes considered here it follows 
that P{po,p) = does not have any solutions po = unless p = 0. But then the matrix M""^ must 
be of definite signature. For suppose that this is not the case, then one could find p ^ such 
that M^^piPj = 0. That would imply extra zero solutions for pQ, in contradiction to the energy 
distinguishing condition. The same holds for the matrix A^. Thus, without loss of generality, we 
assume that M is negative definite; then using Descarte's rule of signs, hyperbolicity of P implies 
that A^ must be positive definite. The opposite definiteness of M and A^ can be shown to be also 



sufficient for the hyperbolicity of (All). This is indeed the case for class I area metrics (33) with 
p = a = T. 
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